Robust Identification of Noncoding RNA from Transcriptomes Requires Phylogenetically-Informed Sampling
نویسندگان
چکیده
Noncoding RNAs are integral to a wide range of biological processes, including translation, gene regulation, host-pathogen interactions and environmental sensing. While genomics is now a mature field, our capacity to identify noncoding RNA elements in bacterial and archaeal genomes is hampered by the difficulty of de novo identification. The emergence of new technologies for characterizing transcriptome outputs, notably RNA-seq, are improving noncoding RNA identification and expression quantification. However, a major challenge is to robustly distinguish functional outputs from transcriptional noise. To establish whether annotation of existing transcriptome data has effectively captured all functional outputs, we analysed over 400 publicly available RNA-seq datasets spanning 37 different Archaea and Bacteria. Using comparative tools, we identify close to a thousand highly-expressed candidate noncoding RNAs. However, our analyses reveal that capacity to identify noncoding RNA outputs is strongly dependent on phylogenetic sampling. Surprisingly, and in stark contrast to protein-coding genes, the phylogenetic window for effective use of comparative methods is perversely narrow: aggregating public datasets only produced one phylogenetic cluster where these tools could be used to robustly separate unannotated noncoding RNAs from a null hypothesis of transcriptional noise. Our results show that for the full potential of transcriptomics data to be realized, a change in experimental design is paramount: effective transcriptomics requires phylogeny-aware sampling.
منابع مشابه
Study of Long Noncoding RNA FER1L4 and RB1, as Its Competing Endogenous RNA Network Target Gene, in Breast Cancer
Introduction: Breast cancer is the second most common cause of cancer-related death among females, which requires an exploration for markers to propose a more specific categorization of this cancer. Long noncoding RNAs (lncRNAs), the main subset of noncoding transcripts, are involved in tumorigenic processes. In this study, we investigated the expression of the fer-1–like family member 4 (FER...
متن کاملDev105858 2325..2330
Differential geneexpression is aprerequisite for the formationofmultiple cell types from the fertilized egg during embryogenesis. Understanding the gene regulatory networks controlling cellular differentiation requires the identification of crucial differentially expressed control genes and, ideally, the determination of the complete transcriptomes of each individual cell type. Here, we have an...
متن کاملDev105858 1..6
Differential geneexpression is aprerequisite for the formationofmultiple cell types from the fertilized egg during embryogenesis. Understanding the gene regulatory networks controlling cellular differentiation requires the identification of crucial differentially expressed control genes and, ideally, the determination of the complete transcriptomes of each individual cell type. Here, we have an...
متن کاملAn Rrp6-like protein positively regulates noncoding RNA levels and DNA methylation in Arabidopsis.
Rrp6-mediated nuclear RNA surveillance tunes eukaryotic transcriptomes through noncoding RNA degradation and mRNA quality control, including exosomal RNA decay and transcript retention triggered by defective RNA processing. It is unclear whether Rrp6 can positively regulate noncoding RNAs and whether RNA retention occurs in normal cells. Here we report that AtRRP6L1, an Arabidopsis Rrp6-like pr...
متن کاملPrinciples of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species.
The inability to predict long noncoding RNAs from genomic sequence has impeded the use of comparative genomics for studying their biology. Here, we develop methods that use RNA sequencing (RNA-seq) data to annotate the transcriptomes of 16 vertebrates and the echinoid sea urchin, uncovering thousands of previously unannotated genes, most of which produce long intervening noncoding RNAs (lincRNA...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 10 شماره
صفحات -
تاریخ انتشار 2014